docs: aviary, verifiers, reasoning gym env integration docs #617

cmunley1 · 2026-01-28T06:27:09Z

No description provided.

Signed-off-by: Christian Munley <[email protected]>

hwolff99 · 2026-02-12T17:24:39Z

docs/environment-tutorials/reasoning-gym.md

+
+## Rollout Collection
+
+### Start vLLM Server


Can we follow the pattern where we are using a hosted model to generate rollouts like the quickstart?

echo "policy_base_url: https://api.openai.com/v1 policy_api_key: your-openai-api-key policy_model_name: gpt-4.1-2025-04-14" > env.yaml

Unnecessary burden to get a model and serve it with vLLM, right?

hwolff99 · 2026-02-12T17:33:44Z

docs/environment-tutorials/aviary.md

+
+## Example Usage
+
+### GSM8K Environment


Then unlike my comment here on Reasoning gym https://github.com/NVIDIA-NeMo/Gym/pull/617/changes#r2800137030 we do not have the "setup steps" before running ng_run

Need one pattern and to follow it

hwolff99 · 2026-02-12T17:35:13Z

docs/environment-tutorials/verifiers.md

+
+---
+
+## Start Model Server


Similar comment to the other env tutorials about a hosted model raising barrier to entry and consistency with quickstart.

Also this one doesn't have the instruction to pull weights from HF

hwolff99 · 2026-02-12T17:36:07Z

docs/environment-tutorials/verifiers.md

+## Start Model Server
+
+```bash
+uv add vllm


are we going with pip or uv? reasoning env has pip install https://github.com/NVIDIA-NeMo/Gym/pull/617/changes#diff-ada604f88b18e8dbff44f513c28f5aad984dc5e3bbbd213d4c1aadd9214350f9R64

env int docs

53ab5f4

Signed-off-by: Christian Munley <[email protected]>

cmunley1 requested review from cwing-nvidia, hwolff99 and lbliii and removed request for lbliii January 28, 2026 06:27

cmunley1 and others added 4 commits January 27, 2026 22:36

doc

5c3027c

Signed-off-by: Christian Munley <[email protected]>

Merge branch 'main' into cmunley1/env-int-docs

eb05659

Merge branch 'main' into cmunley1/env-int-docs

6394335

Merge branch 'main' into cmunley1/env-int-docs

546c736

hwolff99 reviewed Feb 12, 2026

View reviewed changes

cmunley1 closed this Feb 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: aviary, verifiers, reasoning gym env integration docs #617

docs: aviary, verifiers, reasoning gym env integration docs #617

Uh oh!

cmunley1 commented Jan 28, 2026

Uh oh!

hwolff99 Feb 12, 2026

Uh oh!

hwolff99 Feb 12, 2026

Uh oh!

hwolff99 Feb 12, 2026

Uh oh!

hwolff99 Feb 12, 2026

Uh oh!

hwolff99 Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

docs: aviary, verifiers, reasoning gym env integration docs #617

docs: aviary, verifiers, reasoning gym env integration docs #617

Uh oh!

Conversation

cmunley1 commented Jan 28, 2026

Uh oh!

hwolff99 Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

hwolff99 Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

hwolff99 Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

hwolff99 Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

hwolff99 Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants